Job Radar. Live notifications. AI processed.
upwork.com 2026-04-16 🟡
🔹 Refactor Raw Job Descriptions into a Structured Technical Manifest (JSON)
👤 Client: 🇮🇱 Israel Member since 2021-09-22
💰 Price: $800
🚩 Problem: Automate web scraping to handle high API call volume from Cloudflare-backed search engine without authentication.
📦 Existing: [Current Tech: Python, BeautifulSoup, Requests]
Specifications:
[Target] Automate data extraction from an open search engine with Cloudflare protection.
[Method] Implement proxy rotation and rate limiting to handle API call volume efficiently.
[UI/UX] Not specified
[Stack] Python, Scrapy, Selenium (for dynamic content), Proxy services
[Security] Ensure compliance with scraping laws; use proxies to avoid IP blocking.
[Format] JSON
Workflow:
1. Analyze the target search engine's structure and identify key data points.
2. Set up proxy rotation using a service like ScraperAPI or ProxyMesh.
3. Implement rate limiting in Python script to avoid hitting API limits.
4. Use Scrapy for structured scraping if static content, Selenium for dynamic content.
5. Test the scraper with a small dataset and refine as necessary.
6. Deploy the scraper and monitor its performance.